Creating Concept Hierarchies in an Information Retrieval System
نویسندگان
چکیده
Most information retrieval systems are comprised of a focused set of domain-specific documents located within a single logical repository. A mechanism is developed by which user queries against such a system are used to generate a concept hierarchy pertinent to the domain. First, an algorithm is described which extracts terms from documents matching user queries, and then reduces this set of terms to a manageable length. The resulting terms are used to generate a feature vector for each query, and the queries are clustered using a Hierarchical Agglomerative Clustering (HAC) algorithm. The HAC algorithm generates a binary tree of clusters, which is not particularly amenable to use by humans and which is slow to search due to its depth, so a subsequent processing step applies min-max partitioning to form a shallower, bushier tree that is a more natural representation of the concept hierarchy inherent in the system.
منابع مشابه
Automatically Organising Images using Concept Hierarchies
In this paper we discuss the use of concept hierarchies, an approach to automatically organize a set of documents based upon a set of concepts derived from the documents themselves for image retrieval. Co-occurrence between terms associated with image captions and a statistical relation called subsumption are used to generate term clusters which are organized hierarchically. Previously, the app...
متن کاملThe Feasibility Study of Launching Book Recommendation System on the Basis of a Lending and Selling System of e-Books and Digital Taktab
Background:The study was conducted to achieve three axes of goals (users, publishers and the system) by way of objectives related to: A) Users - measuring the level of their satisfaction with Taktab system and also use of various methods of data retrieval; B) Publishers - Measuring the level of their satisfaction with Taktab system and also their expectations of the existence of a recommending...
متن کاملInferring User’s Information Context from User Profiles and Concept Hierarchies
The critical elements that make up a user’s information context include the user profiles that reveal long-term interests and trends, the short-term information need as might be expressed in a query, and the semantic knowledge about the domain being investigated. The next generation of intelligent information agents, that can seamlessly integrate these elements into a single framework, are enab...
متن کاملKnowledge Organisation and Information Retrieval with Galois Lattices
In this paper we investigate the application of Galois (or concept) lattices on different data sources (e.g. web documents or bibliographical items) in order to organise knowledge that can be extracted from the data. This knowledge organisation can then be used for a number of purposes (e.g. knowledge management in an organisation, document retrieval on the Web, etc.). Galois lattices can be co...
متن کاملUsing Concept Hierarchies to Enhance User Queries in Web-based Information Retrieval
The effectiveness of Internet search engines is often hampered by the ambiguity of user queries and the reluctance or inability of users to build less ambiguous multi-word queries. Our system, ARCH, is a client-side Web agent, which incorporates domainspecific concept hierarchies together with interactive query formulation in order to automatically produce a richer and therefore less ambiguous ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005